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Figure 4 



EcoRI 
I 

GAATTCGGCACGAGGGAAGCTCACCTCACGCCGGCGACGCCATCGCCATTCTTCCCACTA 
X + + + + + + go 

CTTAAGCCGTGCTCCCTTCGAGTGGAGTGCGGCCGCTGCGGTAGCGGTAAGAAGGGTGAT 

EFGTREAHLTPATPSPFFPL 

GCAGGGCCTCACAAGTACATCGCGCTCCTTCTGGTTGTCCTCTCATGGATCCTGGTCCAG 

61 + + + + + + 120 

CGTCCCGGAGTGTTCATGTAGCGCGAGGAAGACCAACAGGAGAGTACCTAGGACCAGGTC 

AGPHKYIALLLVVLSWILVQ 

AGGTGGAGCCTGAGGAAGCAGAAAGGCCCGAGATCATGGCCAGTCATCGGCGCAACGGTG 

121 + + + + + + 180 

TCCACCTCGGACTCCTTCGTCTTTCCGGGCTCTAGTACCGGTCAGTAGCCGCGTTGCCAC 

RWSLRKQKGPRSWPVIGATV 

GAGCAGCTGAGGAACTACCACCGGATGCACGACTGGCTTGTCGGGTACCTGTCACGGCAC 

181 + + + + + + 240 

CTCGTCGACTCCTTGATGGTGGCCTACGTGCTGACCGAACAGCCCATGGACAGTGCCGTG 

EQLRNYHRMHDWLVGYLSRH 

AGGACAGTGACCGTCGACATGCCGTTCACTTCCTACACCTACATCGCTGACCCGGTGAAT 
241 + + + + + + 300 

TCCTGTCACTGGCAGCTGTACGGCAAGTGAAGGATGTGGATGTAGCGACTGGGCCACTTA 

RTVTVDMPFTSYTYIADPVN 

GTCGAGCATGTCCTCAAGACTAACTTCACCAATTACCCCAAGGGAATCGTGTACAGATCC 
301 + + + + + + 360 

CAGCTCGTACAGGAGTTCTGATTGAAGTGGTTAATGGGGTTCCCTTAGCACATGTCTAGG 

VEHVLKTNFTNYPKGIVYRS 

TACATGGACGTGCTCCTCGGTGACGGCATCTTCAACGCCGACGGCGAGCTGTGGAGGAAG 
361 + + + + + + 420 

ATGTACCTGCACGAGGAGCCACTGCCGTAGAAGTTGCGGCTGCCGCTCGACACCTCCTTC 
YMDVLLGDGI FNADGELWRK 

CAGAGGAAGACGGCGAGTTTCGAGTTCGCCTCCAAGAACCTGAGGGATTTCAGCGCCATT 
421 + + + + + + 480 

GTCTCCTTCTGCCGCTCAAAGCTCAAGCGGAGGTTCTTGGACTCCCTAAAGTCGCGGTAA 
QRKTASFEFASKNLRDFSAI 
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GTGTTCAGAGAGTACTCCCTGAAGCTGTCGGGTATACTGAGCCAGGCATCCAAGGCAGGC 

481 + + + + -- -- + + 540 

CACAAGTCTCTCATGAGGGACTTCGACAGCCCATATGACTCGGTCCGTAGGTTCCGTCCG 

VFREYSLKLSGILSQASKAG 

AAAGTTGTGGACATGCAGGAACTTTACATGAGGATGACGCTGGACTCCATCTGCAAGGTT 

541 + + -- - + + + + 600 

TTTCAACACCTGTACGTCCTTGAAATGTACTCCTACTGCGACCTGAGGTAGACGTTCCAA 

KVVDMQELYMRMTLDS I CKV 

GGGTTCGGGGTCGAGATCGGCACGCTGTCGCCAGATCTCCCCGAGAACAGCTTCGCGCAG 
601 + + + + + + 660 

CCCAAGCCCCAGCTCTAGCCGTGCGACAGCGGTCTAGAGGGGCTCTTGTCGAAGCGCGTC 

GFGVEIGTLSPDLPENSFAQ 

GCGTTCGATGCCGCCAACATCATCATCACGCTGCGGTTCATCGACCCGCTGTGGCGCATC 
661 + + + + + + 720 

CGCAAGCTACGGCGGTTGTAGTAGTAGTGCGACGCCAAGTAGCTGGGCGACACCGCGTAG 
AFDAANII ITLRFIDPLWRI 

AAGAGGTTCTTCCACGTCGGGTCAGAGGCCCTCCTAGCGCAGAGCATCAAGCTCGTGGAC 
721 + + + + + + 780 

TTCTCCAAGAAGGTGCAGCCCAGTCTCCGGGAGGATCGCGTCTCGTAGTTCGAGCACCTG 

KRFFHVGSEALLAQS IKLVD 

GAGTTCACCTACAGCGTGATCCGCCGGAGGAAGGCCGAGATCGTCGAGGTCCGGGCCAGC 
781 + + + + + + 840 

CTCAAGTGGATGTCGCACTAGGCGGCCTCCTTCCGGCTCTAGCAGCTCCAGGCCCGGTCG 

EFTYSVIRRRKAEIVEVRAS 

GGCAAACAGGAGAAGATGAAGCACGACATCCTGTCACGGTTCATCGAGCTGGGCGAGGCC 
841 + + + + + + 900 

CCGTTTGTCCTCTTCTACTTCGTGCTGTAGGACAGTGCCAAGTAGCTCGACCCGCTCCGG 

GKQEKMKHDILSRFIELGEA 

GGCGACGACGGCGGCGGCTTCGGGGACGATAAGAGCCTCCGGGACGTGGTGCTCAACTTC 
901 + + + + + + 960 

CCGCTGCTGCCGCCGCCGAAGCCCCTGCTATTCTCGGAGGCCCTGCACCACGAGTTGAAG 



GDDGGGFGDDKSLRDVVLNF 

GTGATCGCCGGGCGGGACACGACGGCGACGACGCTGTCGTGGTTCACGCACATGGCCATG 
+ + + + + + 1020 

CACTAGCGGCCCGCCCTGTGCTGCCGCTGCTGCGACAGCACCAAGTGCGTGTACCGGTAC 
VIAGRDTTATTLSWFTHMAM 
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TCCCACCCGGACGTGGCCGAGAAGCTGCGCCGCGAGCTGTGCGCGTTCGAGGCGGAGCGC 

1021 + + + + -- + -+ 1080 

AGGGTGGGCCTGCACCGGCTCTTCGACGCGGCGCTCGACACGCGCAAGCTCCGCCTCGCG 

SHPDVAEKLRRELCAFEAER 

GCGCGCGAGGAGGGCGTCACGCTCGTGCTCTGCGGCGGCGCTGACGCCGACGACAAGGCG 

1081 + + + + + + 1140 

CGCGCGCTCCTCCCGCAGTGCGAGCACGAGACGCCGCCGCGACTGCGGCTGCTGTTCCGC 

AREEGVTLVLCGGADADDKA 

TTCGCCGCCCGCGTGGCGCAGTTCGCGGGCCTCCTCACCTACGACAGCCTCGGCAAGCTG 

1141 + + + + + + 1200 

AAGCGGCGGGCGCACCGCGTCAAGCGCCCGGAGGAGTGGATGCTGTCGGAGCCGTTCGAC 

FAARVAQFAG LLTYDSLGKL 

GTCTACCTCCACGCCTGCGTCACCGAGACGCTCCGCCTGTACCCCGCCGTCCCTCAGGAC 

1201 + + + + + + 1260 

CAGATGGAGGTGCGGACGCAGTGGCTCTGCGAGGCGGACATGGGGCGGCAGGGAGTCCTG 

VYLHACVTETLRLYPAVPQD 

CCCAAGGGGATCCTGGAGGACGACGTGCTGCCGGACGGGACGAAGGTGAGGGCCGGCGGG 

1261 + + + + + + 1320 

GGGTTCCCCTAGGACCTCCTGCTGCACGACGGCCTGCCCTGCTTCCACTCCCGGCCGCCC 

PKGILEDDVLPDGTKVRAGG 

ATGGTGACGTACGTGCCCTACTCGATGGGGCGGATGGAGTACAACTGGGGCCCCGACGCG 

1321 + - - + --- + + + + 1380 

TACCACTGCATGCACGGGATGAGCTACCCCGCCTACCTCATGTTGACCCCGGGGCTGCGC 

MVTYVPYSMGRMEYNWGPDA 

GCGAGCTTCCGGCCGGAGCGGTGGATCAACGAGGATGGCGCGTTCCGCAACGCGTCGCCG 
1381 + + + + + + 1440 

CGCTCGAAGGCCGGCCTCGCCACCTAGTTGCTCCTACCGCGCAAGGCGTTGCGCAGCGGC 

ASFRPERWINEDGAFRNASP 

TTCAAGTTCACGGCGTTCCAGGCGGGGCCGAGGATCTGCCTGGGCAAGGACTCGGCGTAC 
1441 + + + + + + 1500 

AAGTTCAAGTGCCGCAAGGTCCGCCCCGGCTCCTAGACGGACCCGTTCCTGAGCCGCATG 
FKFTAFQAGPRI CLGKDSAY 

CTGCAGATGAAGATGGCGCTGGCCATCCTCTTCCGCTTCTACAGCTTCCGGCTGCTGGAG 

1501 + + + + + -+ 1560 

GACGTCTACTTCTACCGCGACCGGTAGGAGAAGGCGAAGATGTCGAAGGCCGACGACCTC 

L.QMKMALAILFRFYS FRLLE 
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GGGCACCCGGTGCAGTACCGCATGATGACCATCCTCTCCATGGCGCACGGCCTCAAGGTC 

1561 + + + + + + 1620 

CCCGTGGGCCACGTCATGGCGTACTACTGGTAGGAGAGGTACCGCGTGCCGGAGTTCCAG 

GHPVQYRMMTILSMAHGLKV 

CGCGTCTCTAGGGCCGTCTGATGTCATGGCGATTTGGATATGGATATCGTCCCGCTTAAT 

1621 + + + + + + 1680 

GCGCAGAGATCCCGGCAGACTACAGTACCGCTAAACCTATACCTATAGCAGGGCGAATTA 

RVSRAV* CHGDLDMDIVPLN 

CCACGACAAATAACGCTCGTGTTACAAATTTGCATGCATGCATGTAAGGGAAAGCGATGG 

1681 + + + - + + + 1740 

GGTGCTGTTTATTGCGAGCACAATGTTTAAACGTACGTACGTACATTCCCTTTCGCTACC 

PRQITLVLQICMHACKGKRW 

GTTTCATTGGTGGCTTGGCTTAAGCCTTAAAAACTCCGTCGGGTCTTGCGAACCACCACA 

1741 + + + + + + 1800 

CAAAGTAACCACCGAACCGAATTCGGAATTTTTGAGGCAGCCCAGAACGCTTGGTGGTGT 

VSLVAWLKP* 

TCACTAGTGTTTTGTACTCTACTCCTCAGTGGAAGTGTAGTGACAGCATACAAGTTCATC 
1801 + + + + + + i860 

AGTGATCACAAAACATGAGATGAGGAGTCACCTTCACATCACTGTCGTATGTTCAAGTAG 

Xhol 
I 

ATATATATTATCCTCTTTCTTAAAAAAAAAAAAAAAAAAACTCGAG 
1861 + + + + 1906 



Figure 5 

1 GAATTCCAAG CGAGGCCCTT GTAGCAGAGA GTGTTGCTGA TGCAGTCGGC 
51 GGAAATGAGT GCGTGCTGAG AGCAACGCTG AGGGGTTCCA GGGATGGCAA 
101 TGGCTATGGC AATCGGCTAG AGGTGGAGGA CAAGGTGGTG AGGATTGGGA 
151 GGGCAACCTA TGGCAAGTTG GTGAAGAGGC ACGCAATGAG AGATCTATTC 
201 AGACTTACAC TGGATGCCGC CAACAAATTC AACCTTTAGA TTTTGATACT 
251 GTCACTCCTA CTTTATTCCT TGGTTGGGCA ACTTCCAATA GGCTCATGTT 

3 01 AATCAATGAT TAGTGATTAT TCAGCAAATA TTCTTGTTTG TTTGACATTT 
351 ATAATATGTG GGGTGAGACG GATTAAATAT CATCCATGAG AGCTTTATCT 
401 TCATGCTCTC TTGATTTTGG TTTCAGATCA TTCTTTCAGT GTTCACAAGA 

4 51 ATTTTCTCAG TTTGGTCCAT GTAATTTTTG AAGTGAGGTT CCTTAAATTT 

5 01 CATTATGCTT CCTTTCTTTT CTAGACTAGC AACTGCATGA CTTTTCACTT 
551 TGGGTTCACA AATTGACTCA CAAGAAAACA AATTCACTTT TGGGTTCACA 
601 AATTCCTCTT CAGGATGTAC TTTTCACTTG AACTGTCATG TATAGGAACA 
651 AGGAATGGCT CAGTTTTTAA GGAACAATGT ACAGATTTCA TTTCAGAACT 
701 CTTTCTGGTT GGTTGAGTTT CAGACTTTTT GTACCAAGCT GATGGATCAC 
751 AATACTTGTT TCCAAAGTCT GATAACAGAA ACTGGCAACT CCTAATTGAT 
801 AATAAAAAGA ATAAAATACA GTATCAGATA TCTCATTTTC TTGGTTGGCA 
851 GATCACAAAA AGGAACACAA AGGCTAAGCC TCCTACTTGT TCGGGAGTTA 
901 GGTCAGGGAC ACCATATGAA TGAAAGAAAT CTTAATTTGG GGTCACACCA 
951 AGATTGTCTC TCTCGAGGTT GGGGGGTCCC TAAGGTTGGT AGTAGCAATA 

1001 CCCAATATAT CACCTAACAA ACCCAATCCA TGCTACATAC ATACATAGCA 
1051 TCCATCACTT GTAGACTGGA CCCTTCATCA AGAGCACCAT GGAGGAAGCT 
1101 CACATCACGC CGGCGACGCC ATCGCCATTC TTCCCACTAG CAGGGCCTCA 
1151 CAAGTACATC GCGCTCCTCC TGGTTGTCCT CTCATGGATC CTGGTCCAGA 
12 01 GGTGGAGCCT GAGGAAGCAG AAAGGCCCGA GATCATGGCC AGTCATCGGT 
1251 GCAACGGTGG AGCAGCTGAG GAACTACCAC CGGATGCACG ACTGGCTTGT 
1301 CGGGTACCTG TCACGGCACA GGACAGTGAC CGTCGACATG CCGTTCACTT 
1351 CCTACACCTA CATCGCTGAC CCGGTGAATG TCGAGCATGT CCTCAAGACT 



Figure 5B 

14 01 AACTTCACCA ATTACCCCAA GGTAAATGAC CTGAACTCAC TGATGTTCAG 

1451 TCTTCGGAAA TCAGAGCTGA AAGCTGAATC GAATGTGCCT GAACACCGTG 

1501 TAGGGAATCG TGTACAGATC CTACATGGAC GTGCTCCTCG GTGACGGCAT 

1551 CTTCAACGCC GACGGCGAGC TGTGGAGGAA GCAGAGGAAG ACGGCGAGTT 

1601 TCGAGTTCGC CTCCAAGAAC CTGAGGGATT TCAGCGCCAT TGTGTTCAGA 

1651 GAGTACTCCC TGAAGCTGTC GGGTATACTG AGCCAGGCAT CCAAGGCAGG 

1701 CAAAGTTGTG GACATGCAGG TGAGATCACT GCTCCCTTGC CATTGCCAAC 

1751 ATGAGCATTT CAACCTGAGA CACGAGAGCT ACCTTGCCGA TTCAGGAACT 

1801 TTACATGAGG ATGACGCTGG ACTCCATCTG CAAGGTTGGG TTCGGGGTCG 

1851 AGATCGGCAC GCTGTCGCCG GATCTCCCCG AGAACAGCTT CGCGCAGGCG 

1901 TTCGATGCCG CCAACATCAT CGTCACGCTG CGGTTCATCG ACCCGCTGTG 

1951 GCGCATCAAG AGGTTCTTCC ACGTCGGGTC AGAGGCCCTC CTAGCGCAGA 

2001 GCATCAAGCT CGTGGACGAG TTCACCTACA GCGTGATCCG CCGGAGGAAG 

2051 GCCGAGATCG TCGAGGCCCG GGCCAGCGGC AAACAGGAGA AGGTACGTGC 

2101 ACATGACTGT TTCGATTCTT CAGTTCATCG TCTTGGCCGG GATGGACCTG 

2151 ATCCTGATTG ATTATATATC CGTGTGACTT GTGAGGACAA ATTAAAATGG 

2201 GCAGATGAAG CACGACATCC TGTCACGGTT CATCGAGCTA GGCGAGGCCG 

2251 GCGACGACGG CGGCGGCTTC GGGGACGACA AGAGCCTCCG GGACGTGGTG 

2301 CTCAACTTCG TGATCGCCGG GCGGGACACG ACGGCGACGA CGCTGTCGTG 

2351 GTTCACGCAC ATGGCCATGT CCCACCCGGA CGTGGCCGAG AAGCTGCGCC 

2401 GCGAGCTGTG CGCGTTCGAG GCGGAGCGCG CGCGCGAGGA GGGCGTCGCG 

2451 CTCGTGCCCT GCGGCGGCGC TGACGCCGAC GACAAGGCGT TCGCCGCCCG 

2501 CGTGGCGCAG TTCGCGGGCC TCCTCACCTA CGACAGCCTC GGCAAGCTGG 

2551 TCTACCTCCA CGCCTGCGTC ACCGAGACGC TCCGCCTGTA CCCCGCCGTC 

2601 CCTCAGGTGA GCGCGCCCGA CACGCGACCT CCGGTCCAGA GCACAGCATG 

2651 CAGTGAGTGG ACCTGAATGC AATGCACATG CACTTGCGCG CGCGCAGGAC 

2701 CCCAAGGGGA TCCTGGAGGA CGACGTGCTG CCGGACGGGA CGAAGGTGAG 

2751 GGCCGGCGGG ATGGTGACGT ACGTGCCCTA CTCGATGGGG CGGATGGAGT 
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2801 ACAACTGGGG CCCCGACGCG GCGAGCTTCC GGCCGGAGCG GTGGATCAAC 

2851 GAGGATGGCG CGTTCCGCAA CGCGTCGCCG TTCAAGTTCA CGGCGTTCCA 

2901 GGCGGGGCCG AGGATCTGCC TGGGCAAGGA CTCGGCGTAC CTGCAGATGA 

2951 AGATGGCGCT GGCCATCCTC TTGCGCTTCT ACAGCTTCCG GCTGCTGGAG 

3001 GGGCACCCGG TGCAGTACCG CATGATGACC ATCCTCTCCA TGGCGCACGG 

3051 CCTCAAGGTC CGCGTCTCTA GGGCCGTCTG ATGTCATGGC GATTTGGGAT 

3101 ATCATCCCGC TTAATCCTTA AAAATTTGCA TGCATGCATG TAAGGGAAAG 

3151 CGATGGGTTT CATTGGTGGC TTGGCTTAAG CCTTAAAAAC TCCGTCGGGT 

3201 CTTGCGAACC ACCACATCAC TAGTGTTTTG TACTCTACTG CTCAGTGGAA 

3251 GTGTAGTGAC AGCATACAAG TTCATCATAT ATATTATCCT CTTTCTTCGC 

3301 CGGATGCTTC CCGGGACCTT TTGGAGACCA TTACTGACAG GCGTGTGAAA 

3351 AAAAGGCTTC TTCTGCGGCG AAGTTTTGGG TTCAGAGTCT TGGCGTCTTT 

%J 3401 GCAGCAGAAA AAAGGTTTGG AAGGATCTGA ACCCTGAACC GAAAATGGCT 

5 " 

Ls. 3451 TCGGAAATAT GCTCGCATCG GGGCGGGGCC GTCACTCGGG ATGACGACAA 

3501 GCCCACAAGC AGTGAGAGCG AAGCGATCTT TGGAGTTTGG AGACACTCTC 

3551 GGACCCCTCG GCGCTCCGCG AGCTCATCTT CGCCTCCTCT GTCGTGTCCG 

3601 TGGCGGCACC GCGCCCGCCC GCCTCGTGTT CGACCAAATC CCGCGCCCCG 

3651 ACCGGTTCGT GTACAACACC CTCATCCGCG GCGCCGCGCG CAGTGACACG 

3701 CCCCGGGACG CCGTATACAT CTATAAATCA TGGTATTGTA CTTTATTTTC 

3751 AAACGGCCTT AACACAACCA TATTTTTATG GTAAACACGT TCAAAATTGA 

3801 CACAAATTTA AAACAGGCAC AAACCGTAGC TAAACATAAG AGAATGAGAG 

3851 ACAACCCAAA GGTTAGAGAT GAAATAAGCT GAGTAAACGA CGAATTC 
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2501 CGTGGCGCAGTTCGCGGGCCTCCTCACCTACGACAGCCTCGGCAAGCTGG 2550 

MlllllllllllllillllllMIIIIIIIMIIIIIIIlllMlllll 

1152 CGTGGCGCAGTTCGCGGGCCTCCTCACCTACGACAGCCTCGGCAAGCTGG 1201 
2551 TCTACCTCCACGCCTGCGTCACCGAGACGCTCCGCCTGTACCCCGCCGTC 2600 

M MIIIIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIIIIIIMIII 

1202 TCTACCTCCACGCCTGCGTCACCGAGACGCTCCGCCTGTACCCCGCCGTC 1251 
2601 CCTCAGGTGAGCGCGCCCGACACGCGACCTCCGGTCCAGAGCACAGCATG 2650 
1252 CCT 1254 

2651 CAGTGAGTGGACCTGAATGCAATGCACATGCACTTGCGCGCGCGCAGGAC 2700 

1255 111111 

X ' ! " CAGGAC 1260 

2701 CCCAAGGGGATCCTGGAGGACGACGTGCTGCCGGACGGGACGAAGGTGAG 2750 

MMiimiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiimii 
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illMIMIIIIIIIIIIMIIIIIIIMIlllMiiiiiiiiiiiiiii 

1461 GGCGGGGCCGAGGATCTGCCTGGGCAAGGACTCGGCGTACCTGCAGATGA 1510 
2951 AGATGGCGCTGGCCATCCTCTTGCGCTTCTACAGCTTCCGGCTGCTGGAG 3000 

NlllilllMIIIIIIIIIII IIIIIMIMMIIMMIIIIIIMI 

1511 AGATGGCGCTGGCCATCCTCTTCCGCTTCTACAGCTTCCGGCTGCTGGAG 1560 
3001 GGGCACCCGGTGCAGTACCGCATGATGACCATCCTCTCCATGGCGCACGG 3050 

, M "IIIIIIMIIIIIIIMllMliiiiiiiiiiiiiiiiiiiiiiiiii 

1561 GGGCACCCGGTGCAGTACCGCATGATGACCATCCTCTCCATGGCGCACGG 1610 
3051 CCTCAAGGTCCGCGTCTCTAGGGCCGTCTGATGTCATGGCGATTTG 3096 

HI II Ml MIIIIIIIIMIIIIMIIIIIIMIIII III 11111"'" 

1611 CCTCAAGGTCCGCGTCTCTAGGGCCGTCTGATGTCATGGCGATTTGGATA 1660 

3097 . GGATATCATCCCGCTTAATCC ' TTAAAAATT hoc 

IIMIII llllllll |T^»TT31 2 6 

1661 TGGATATCGTCCCGCTTAATCCACGACAAATAACGCTCGTGTTACAAATT 1710 
3127 TGCATGCATGCATGTAAGGGAAAGCGATGGGTTTCATTGGTGGCTTGGCT 3176 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMMIIII 

1711 TGCATGCATGCATGTAAGGGAAAGCGATGGGTTTCATTGGTGGCTTGGCT 1760 



Figure 6D 



3177 TAAGCCTTAAAAACTCCGTCGGGTCTTGCGAACCACCACATCACTAGTGT 3226 

1 1 f 1 1 1 1 1 1 1 1 ! I f 1 1 i 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i f 1 1 1 1 1 1 

1761 TAAGCCTTAAAAACTCCGTCGGGTCTTGCGAACCACCACATCACTAGTGT 1810 
3227 TTTGTACTCTACTCCTCAGTGGAAGTGTAGTGACAGCATACAAGTTCATC 3276 

1 i 1 1 1 1 1 1 f 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 i i 1 1 1 f 1 i 1 1 

1811 TTTGTACTCTACTCCTCAGTGGAAGTGTAGTGACAGCATACAAGTTCATC 1860 
3277 ATATATATTATCCTCTTTCTTCGCCGGATGCTTCCCGGGACCTTTTGGAG 3326 

1 1 1 1 1 1 M 1 1 ! 1 1 1 1 ! ! I i 1 1 I M 

1861 ATATATATTATCCTCTTTCTTAAAAAAAAAAAAAAAAAAACTCGAG . . . . 1906 
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Figure 7 



Figure 8 

1 GAATTCCAAG CGAGGCCCTT GTAGCAGAGA GTGTTGCTGA TGCAGTCGGC 
51 GGAAATGAGT GCGTGCTGAG AGCAACGCTG AGGGGTTCCA GGGATGGCAA 
101 TGGCTATGGC AATCGGCTAG AGGTGGAGGA CAAGGTGGTG AGGATTGGGA 
151 GGGCAACCTA TGGCAAGTTG GTGAAGAGGC ACGCAATGAG AGATCTATTC 
201 AGACTTACAC TGGATGCCGC CAACAAATTC AACCTTTAGA TTTTGATACT 
251 GTCACTCCTA CTTTATTCCT TGGTTGGGCA ACTTCCAATA GGCTCATGTT 
301 AATCAATGAT TAGTGATTAT TCAGCAAATA TTCTTGTTTG TTTGACATTT 
351 ATAATATGTG GGGTGAGACG GATTAAATAT CATCCATGAG AGCTTTATCT 
401 TCATGCTCTC TTGATTTTGG TTTCAGATCA TTCTTTCAGT GTTCACAAGA 
451 ATTTTCTCAG TTTGGTCCAT GTAATTTTTG AAGTGAGGTT CCTTAAATTT 
501 CATTATGCTT CCTTTCTTTT CTAGACTAGC AACTGCATGA CTTTTCACTT 
551 TGGGTTCACA AATTGACTCA CAAGAAAACA AATTCACTTT TGGGTTCACA 
601 AATTCCTCTT CAGGATGTAC TTTTCACTTG AACTGTCATG TATAGGAACA 
651 AGGAATGGCT CAGTTTTTAA GGAACAATGT ACAGATTTCA TTTCAGAACT 
701 CTTTCTGGTT GGTTGAGTTT CAGACTTTTT GTACCAAGCT GATGGATCAC 
751 AATACTTGTT TCCAAAGTCT GATAACAGAA ACTGGCAACT CCTAATTGAT 
8 01 AATAAAAAGA ATAAAATACA GTATCAGATA TCTCATTTTC TTGGTTGGCA 
851 GATCACAAAA AGGAACACAA AGGCTAAGCC TCCTACTTGT TCGGGAGTTA 
901 GGTCAGGGAC ACCATATGAA TGAAAGAAAT CTTAATTTGG GGTCACACCA 
951 AGATTGTCTC TCTCGAGGTT GGGGGGTCCC TAAGGTTGGT AGTAGCAATA 
1001 CCCAA TATAT CA CCTAACAA ACCCAATCCA TGCTACATAC ATACATAGCA 
1051 TCCATCACTT GTAGACTGGA CCCTTCATCA AGAGCACCAT GG 
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Figure 12 



1 GGAA 4 

I I 

201 C CGG ATGCACGACTGGCTTGTCGGGTAC CTGTCACGGCACAGGACAGTGA 250 
5 TTCGGCTTATGCCGTTCACTTCCTACACCTACATCGCTGACCCGGTGAAT 54 

M M 11 1 M 1 M M 1 1 1 ! ! 1 1! I ! ! I i M I S i 1 1 1 ! 1 1 ! 1 1 1 

251 CCGTCGACATGCCGTTCACTTCCTACACCTACATCGCTGACCCGGTGAAT 300 
55 GTCGAGCATGTCCTCAAGACTAACTTCACCAATTACCCCAAGGGGGACGT 104 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 !! 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 Ml 

3 01 GTCGAGCATGTCCTCAAGACTAACTTCACCAATTACCCCAAGGGAATCGT 350 
105 GTACAGATCCTACATGGATGTGCTCCTCGGTGACGGCATATTCAACGCTG 154 

IMIIIMMMIIMM llllllllllllllllllll llllllll I 

351 GTACAGATCCTACATGGACGTGCTCCTCGGTGACGGCATCTTCAACGCCG 4 00 
155 ACGGCGAGCTGTGGAGGAAGCAGAGGAAGACGGCGAGTTTCGAGTTCGCC 204 

MMMMMMMMMMIMMMMMMMMMMIMMMM 

401 ACGGCGAGCTGTGGAGGAAGCAGAGGAAGACGGCGAGTTTCGAGTTCGCC 450 
205 TCCAAGAACCTGAGGGATTTCAGTGCCAATGTTTTCAGAGAGTACTCCCT 254 

IIMIIIII1IIIIIMIIIIII 1111 III i 1 1 1 1 1 M II 1 1 1 1 !! I 

451 TCCAAGAACCTGAGGGATTTCAGCGCCATTGTGTTCAGAGAGTACTCCCT 500 
255 GAAGCTGTCGGGCATACTGAGTCAGGCATCCAAGGCAGGCAAAGTTGTTG 3 04 

1 1 II i 1 1 1 M M 1 1 1 1 ! M i IMIMMIMMIIIMMIIIIII I 

501 GAAGCTGTCGGGTATACTGAGCCAGGCATCCAAGGCAGGCAAAGTTGTGG 550 
305 ACATGCAGGAACTTTACATGAGGATGACACTGGACTCGATCTGCAANGTT 354 

IMIIIMIIIIIIIMMIIIIMIII llllllll lllllllhlll 

551 ACATGCAGGAACTTTACATGAGGATGACGCTGGACTCCATCTGCAAGGTT 600 
355 GGGTTCGGGGTCNANATCGGCACGCTGTCNCCGGATCTCCCCGAGAACAG 404 

1 1 1 M I II I i M 1 1 1 1 1 1 M I M 1 1 1 h 1 1 I ! 1 1 1 ! I II M 1 1 1 1 1 1 

601 GGGTTCGGGGTCGAGATCGGCACGCTGTCGCCAGATCTCCCCGAGAACAG 650 
4 05 CTTCNCCCAAGCGTTCGATGCCGCTAACATCATCGTCACNCTGCGGTTCA 454 

lllhl 1 1 IIIIIIIIMIMI lllllllll MMMIMMMM 

651 CTTCGCGCAGGCGTTCGATGCCGCCAACATCATCATCACGCTGCGGTTCA 700 
455 TCCACCCNCTGTGGCGCATCCAGAAGTTCTTCCCCNGTCA 4 94 

I! MihMIIIIIIIIM Mi llllllll h 

701 TCGACCCGCTGTGGCGCATCAAGAGGTTCTTCCACGTCGGGTCAGAGGCC 750 

Percent Similarity: 92.510 Percent Identity. 90.891 
Sb200-Sorghr. Pep x Sb20081.Pep February 13, 1997 11:29 .. 



5 MPFTSYTYIADPVNVEHVLKTNFTNYPKGDVYRSYMDVLLGDGIFNADGE 54 

I II M I M I M II II II II I II I I I II II llllllllllllllllllll 
87 MPFTSYTYIADPVNVEHVLKTNFTNYPKGIVYRSYMDVLLGDGIFNADGE 136 

55 LWRKQRKTASFEFASKNLRDFSANVFREYSLKLSGILSQASKAGKVVDMQ 104 

I M 1 1 1 1 M II M 1 1 II M M M MMMMMMMMMMIMM! 

137 LWRKQRKTAS FEFAS KNLRD FSAI VFREYSLKLSG I LSQASKAGKWDMQ 186 



Figure 12B 



105 ELYMRMTLDSICXVGFGVXIGTLSPDLPENSFXQAFDAANIIVTLRFIHP 154 

illinium imi iiiiiimiui miiiiimiiiM 

187 ELYMRMTLDSICKVGFGVEIGTLSPDLPENSFAQAFDAANIIITLRFIDP 236 
155 LVJRIQKFF 162 

mimi 

237 LWRIKRFF 244 



